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As the gap between processor and nnennory speeds continues to widen, methods for 
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becoming increasingly important. One such method, trace-driven memory 
simulation, has been the subject of intense interest among researchers and has, as 
a result, enjoyed rapid development and substantial improvements during the past 
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Embedded system programs tend to spend much time in small loops. Introducing a 
very small loop cache into the instruction memory hierarchy has thus been shown 
to substantially reduce instruction fetch energy. However, loop caches come In 
many sizes and variations — using the configuration best on the average may 
actually result in worsened energy for a specific program. We therefore Introduce a 
loop cache exploration tool that analyzes a particular program's profile, rapidly 
explores the possib ... 
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A Laboratory Support System (LSS) is provided in all No. lA Electronic Switching 
System (ESS) test laboratories to support the ESS software development process. 
The LSS tools are utilized by the developers to accomplish ESS program database 
management, lab load administration, lab load generation, and program testing and 
debugging in the laboratory environment. This document describes one of the LSS 
testing and debugging tools: the No. lA ESS Laboratory Erasable Flag System. This 
facility I ... 
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A new microprogrammable computer with low-level parallelism was built and has 
been utilized as a research vehicle for solving different classes of research-oriented 
applications such as real-time processings on static/dynamic images, pictures and 
signals, and emulations of both existing and virtual machines including high 



(Intermediate) level language nnachines. The design goal of a research-oriented 
computer, QA-1, was to achieve a high degree of processing power and system 
flex! ... 

8 Task scheduling and real-time: Extending STI for demanding hard-reaMime 
systems 

Benjamin Welch, Shobhit Kanaujia, Adarsh Seetharam, Deepaksrivats Thirumalai, 
Alexander G. Dean 

October 2003 Proceedings of the international conference on Compilers, 
architectures and synthesis for embedded systems 

Full text available: ^pdf(618.51 Additional Information: full citation , abstract , references , index 
KB) terms 

Software thread integration (STI) is a compilation technique which enables the 
efficient use of an application's fine-grain idle time on generic processors without 
special hardware support. With STI, a primary function (with real-time 
requirements on specific instructions) is automatically Interleaved with a secondary 
function to create a single implicitly multithreaded function which minimizes context 
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Ennbedded systems generally interact In some way with the outside world. This may 
involve measuring sensors and controlling actuators, communicating with other 
systems, or Interacting with users. These functions impose real-time constraints on 
system design. Verification of these specifications requires computing an upper 
bound on the worst-case execution time (WCET) of a hardware/software system. 
Furthermore, it Is critical to derive a tight upper bound on WCET in order to make 
efficient u . . , 
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Focusing on embedded applications, scratchpad memories (SPMs) look like a 
best-compromise solution when taking into account performance, energy 
consumption and die area. The main challenge in SPM design Is mapping memory 
locations to scratchpad locations. This paper describes an algorithm to optimally 
solve such a mapping problem by means of Dynamic Programming applied to a 
synthesizable hardware architecture. The algorithm works by mapping segments of 
external memory to physically partitioned b ... 

Keywords: design automation, dynamic programming, embedded design, memory 
hierarchy, partitioning algorithm, power saving, scratchpad memory 
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We developed a new hierarchical modular approach for synthesis of area-minimal 
core-based data-intensive systems. The optimization approach employs a novel 
global least-constraining most-constrained heuristic to minimize the instruction 
cache misses for a given application, instruction cache size and organization. Based 
on this performance optimization technique, we constructed a strategy to search 
for a minimal-area processor core, and an instruction and data cache which satisfy 
the performance ... 

Keywords: application-specific system-level synthesis, system modeling and 
performance evaluation, cache line coloring 
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Diagnostic Emulation is the application of microprogramming to the emulation of an 
operational computer to support software development and verification for that 
computer. A conventional technique. Interpretive Computer Simulation (ICS), has 
been used for many years in support of such software development and verification 
efforts. The ICS method is becoming less cost effective. For the development of 
attitude control software for NASA's High Energy Astronomical Observatory (HEAO) 
diagnostic ... 
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As computing systems and their applications become progressively more complex, 
the prediction of the performance of a proposed computer system or the effects of 
changes to an existing system likewise becomes more difficult. Fortunately there 
has been steady growth in performance evaluation technology to meet these 
expanding systems analysis requirements. The most desirable predictive tools 
continue to be analytical mathematical models which usually provide a 
comprehensive theoretical unders ... 
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The ratio of cache hits to cache misses in a computer system is, to a large extent, 
responsible for its characteristics such as energy consumption and performance. In 
recent years energy efficiency has become one of the dominating design 
constraints, due to the rapid growth In market share for mobile 
computing/communication/internet devices. In this paper we present a novel fast 
constructive technique that relocates the instruction code In such a manner into the 
main memory that the cache is util ... 
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This paper presents two topics: Implementation of a debugger through use of an 
incremental compiler, and techniques for fine-grained incremental compilation. 
Both the debugger and the compiler are components of the highly Integrated 
programming environment DICE (Distributed Incremental Compiling Environment) 
which alms at providing programmer support in the case where the programming 
environment resides in a host computer and the program. Is running on a target 
computer that is connected to the ... 
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The effectiveness of garbage collectors and leak detectors in identifying dead 
objects depends on the accuracy of their reachability traversal. Accuracy has two 
orthogonal dimensions: (i) whether the reachability traversal can distinguish 
between pointers and nonpointers {type accuracy), and (ii) whether the reachability 
traversal can identify memory locations that will be dereferenced in the future 
{liveness accuracy). This article presents an experimental study of the impo ... 

Keywords: Conservative garbage collection, leak detection, liveness accuracy, 
program analysis, type accuracy 
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Manual tuning tecliniques are widely applied but are generally slow, costly and 
require a great deal of expertise. This paper addresses the problem of automatically 
tuning the virtual architecture of a microprogrammed computer by 
microprogramming techniques. Two algorithms are presented to automate the 
tuning process. The algorithms are Implemented on the same dynamic 
microprogrammed computer that executes the given application. After execution of 
the program the algorithms are invoked and a ... 
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Current implementations were found to be impractical for airborne use due to 
LISP'S incompatability with conventional computer architectures. Direct execution 
of LISP with tasks distributed between three processors, seemed to be a workable 
solution. The language was analyzed, and a special token was devised, using a 
descriptor with a single pointer. Through careful distribution of responsibilities, 
control and data flow between the processors was minimized. Significant memory 
savings resulted fr ... 
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